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Abstract.  Traditional  methods  for  evaluating  the  amount  of  anonymity 
afforded  by  various  Mix  configurations  have  depended  on  either  measur- 
ing  the  size  of  the  set  of  possible  senders  of  a  particular  message  (the 
anonymity  set  size),  or  by  measuring  the  entropy  associated  with  the 
probability  distribution  of  the  messages  possible  senders.  This  report  ex¬ 
plores  further  in  detail  an  alternative  way  of  assessing  the  anonymity 
of  a  Mix  system  by  considering  the  capacity  of  a  covert  channel  from  a 
sender  behind  the  Mix  to  an  observer  of  the  Mix’s  output. 

Initial  work  considered  a  simple  model  [5],  where  an  observer  (Eve)  was 
restricted  to  counting  the  number  of  messages  leaving  a  Mix  configured 
as  a  firewall  guarding  an  enclave  with  one  malicious  sender  (Alice)  and 
some  other  naive  senders  (Clueless* ’s).  Here,  we  consider  the  case  where 
Eve  can  distinguish  between  multiple  destinations,  and  the  senders  can 
select  to  which  destination  their  message  (if  any)  is  sent  each  clock  tick. 


1  Introduction 

In  [5]  the  idea  of  measuring  the  lack  of  perfect  anonymity  (quasi-anonymity)  via 
a  covert  channel  was  initiated.  This  idea  was  formalized  in  [6].  Our  concern  in 
this  report  is  to  identify,  to  analyze  in  detail,  and  to  calculate  the  capacity  of,  the 
covert  channels  that  arise  from  the  use  of  a  Mix  [1,8]  as  an  exit  firewall  from  a 
private  enclave  (as  briefly  addressed  in  [5,  Sec.  4].)  In  general,  we  refer  to  a  covert 
channel  that  arises,  due  to  a  state  of  quasi-anonymity,  as  a  quasi- anonymous 
channel  [6].  The  quasi-anonymous  channel  also  serves  the  dual  role  of  being  a 
measure  of  the  lack  of  perfect  anonymity.  Note  that  [2]  uses  a  similar  model  for 
statistical  attacks  in  which  Eve  correlates  senders’  actions  with  observed  output. 

*  This  is  an  extended  version  of  [7].  Research  supported  by  the  Office  of  Naval  Re¬ 
search. 

Manuscript  approved  June  4, 2004. 
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2  Exit  Mix-firewall  Model 

There  are  N  +  1  senders  in  a  private  enclave.  Messages  pass  one  way  from  the 
private  enclave  to  a  set  of  M  receivers.  The  private  enclave  is  behind  a  firewall 
that  also  functions  as  a  timed  Mix  [8]  that  fires  every  tick,  t,  hence  we  call  it  a 
simple,  timed  Mix-firewall.  For  the  sake  of  simplicity  we  will  refer  to  a  simple* 
timed  Mix-firewall  as  a  Mix-firewall  in  this  report.  One  of  the  N  4-  1  senders, 
called  Alice,  is  malicious.  The  other  N  clueless  senders,  Clueless*,?  =  1, . . . ,  JV", 
are  benign.  Each  sender  may  send  at  most  one  message  per  unit  time  t  to  the 
set  of  receivers.  All  messages  from  the  private  enclave  to  the  set  of  receivers  pass 
through  public  lines  that  are  subject  to  eavesdropping  by  an  eavesdropper  called 
Eve.  The  only  action  that  Eve  can  take  is  to  count  the  number  of  messages  per 
i  going  from  the  Mix-firewall  to  each  receiver,  since  the  messages  are  otherwise 
indistinguishable.  Eve  knows  that  there  are  N+ 1  possible  senders.  The  N  clueless 
senders  act  in  an  independent  and  identical  manner  (i.i.d.)  according  to  a  fixed 
distribution  (7*,?  =  1, . . .  ,  N.  Alice,  by  sending  or  not  sending  a  message  each  t 
to  at  most  one  receiver,  affects  Eve’s  message  counts.  This  is  how  Alice  covertly 
communicates  with  Eve  via  a  quasi-anonymous  channel  [6]. 


Eve 


Fig.  1.  Exit  Mix-firewall  model  with  N  clueless  senders  and  M  distinguishable  receivers 

Alice  acts  independently  (through  ignorance  of  the  clueless  senders)  when 
deciding  to  send  a  message;  we  call  this  the  ignorance  assumption .  Alice  has  the 
same  distribution  each  t .  Between  Alice  and  the  N  clueless  senders,  there  are 
N  +  1  possible  senders  per  t7  and  there  are  M  +  1  possible  actions  per  sender 
(since  each  sender  may  or  may  not  transmit,  and  if  it  does  transmit,  it  transmits 
to  exactly  one  of  the  M  receivers). 

We  consider  Alice  to  be  the  input  to  the  quasi-anonymous  channel,  which  is 
a  proper  communications  channel  [9].  Alice  can  send  to  one  of  the  M  receivers 
or  not  send  a  message.  Thus,  we  represent  the  inputs  to  the  quasi-anonymous 
channel  by  the  M  - f- 1  input  symbols  0,1,..-  ,  M,  where  %  —  0  represents  Alice 
not  sending  a  message,  and  i  G  ,M}  represents  Alice  sending  a  message 

to  the  ith  receiver  R{.  However,  note  that  the  “receiver”  in  the  quasi-anonymous 
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channel  is  Eve.  Eve  receives  the  output  symbols  ej ,  j  =  1, . . .  ,  K.  Eve  receives  e\ 
if  no  sender  sends  a  message.  Each  other  output  symbol  in  the  quasi-anonymous 
channel  corresponds,  up  to  being  indistinguishable  by  Eve,  to  a  way  that  the 
N  - hi  senders  can  send  or  not  send,  at  most  one  message  each,  out  of  the  private 
enclave,  provided  at  least  one  sender  does  send  a  message. 

For  the  sake  of  simplicity  we  introduce  a  dummy  receiver  Rq  (not  shown 
above).  If  a  sender  does  not  send  a  message  we  consider  that  to  be  a  “message” 
to  R0.  For  N  +  1  senders  and  M  receivers,  the  output  symbol  ej  observed  by 
Eve  is  an  M  +  1  vector  where  a{  is  how  many  messages  the 

Mix-firewall  sends  to  Rf.  Of  course  it  follows  for  all  j  that  YmL o  ai  =  AT  + 1. 

The  quasi-anonymous  channel  that  we  have  been  describing  is  a  discrete 
memoryless  channel  (DMC).  We  define  the  channel  matrix  M  as  an  (M  + 1)  x  K 
matrix,  where  M [i,j]  represents  the  conditional  probability  that  Eve  observes 
the  output  symbol  ej  given  that  Alice  input  i.  We  model  the  clueless  senders 
according  to  the  i.i.d.  C{  for  each  period  of  possible  action  t : 


P(Cluelessi  doesn't  send  a  message)  —  p 


P(CluelesSi  sends  a  message  to  any  Rj,j  >  0) 


Q_ 

M 


1  ~P 
M 


where  in  keeping  with  previous  papers,  q  =  1  -p  is  the  probability  that  Clueless* 
sends  a  message  to  any  one  of  the  M  receivers.  From  above,  when  Clueless; 
does  send  a  message,  the  destination  is  uniformly  distributed  over  the  receivers 
jRi,...  ,jRm-  We  call  this  the  semi-uniformity  assumption.  Again,  keep  in 
mind  that  each  clueless  sender  has  the  same  distribution  each  t,  but  they  all  act 
independently  of  each  other. 

We  model  Alice  according  to  the  following  distribution  each  t: 


P (Alice  sends  a  message  to  Ri)  =  Xi 
Of  course,  this  tells  us  that 


M 

x0  =  P(Alice  doesn't  send  a  message)  =  1  —  ^  Xi  . 

i~  1 

We  let  A  represent  the  distribution  for  Alice’s  input  behavior,  and  we  denote  by 
E  the  distribution  of  the  output  symbols  that  Eve  receives.  Thus,  the  channel 
matrix  M  along  with  the  distribution  A  totally  determine  the  quasi-anonymous 
channel.  This  is  because  the  elements  of  M  take  the  distributions  C{  into  account, 
and  M  and  A  let  one  determine  the  distribution  E  describing  the  outputs  that 
Eve  receives,  P(Eve  receives  ej). 

Now  that  we  have  our  set-up  behind  our  exit  Mix-firewall  model,  we  may 
now  go  on  to  analyze  various  cases  in  detail. 


3  Capacity  Analyses  of  the  Exit  Mix-firewall  Model 

The  mathematics  of  the  problem  gets  quite  complex.  Therefore,  we  start  with 
some  simple  special  cases  before  attempting  to  analyze  the  problem  in  general. 
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The  mutual  information  between  A  and  E  is  given  by 

I(A,E)  =  H(A)  -  H(A\E)  =  H(E)  -  H(E\A)  =  I(E,A). 

The  capacity  of  the  quasi-anonymous  channel  is  given  by  [9] 

C  —  max  I(A,E)  , 

A 

where  the  maximization  is  over  the  different  possible  values  that  the  X{  may  take 
(of  course,  the  X{  are  still  constrained  to  represent  a  probability  distribution). 
Recall  M[i7j]  —  P(E  =  ej\A  =  f),  where  is  the  entry  in  the  ith  row 

and  jth  column  of  the  channel  matrix,  M.  To  distinguish  the  various  channel 
matrices,  we  will  adopt  the  notation  that  is  the  channel  matrix  for  N 

clueless  senders  and  M  receivers. 


3.1  One  Receiver  (M  =  1) 

Case  1  —  No  clueless  senders  and  one  receiver  (N  =  0,  M  =  1) 

Alice  is  the  only  sender,  and  there  is  only  one  receiver  R\ .  Alice  sends  either  0 
(by  not  sending  a  message)  or  1  (by  sending  a  message).  Eve  receives  either  e\  — 
(1,G)  (Alice  did  nothing)  or  e%  =  (0, 1)  (Alice  sent  a  message  to  the  receiver). 
Since  there  is  no  noise  (there  are  no  clueless  senders)  the  channel  matrix  M0.i  is 
the  2x2  identity  matrix  and  it  trivially  follows  that  P(E  —  ei)  —  x0)  and  that 
P(E  =  e2)  = 


ci  e2 


Since  x0  =  1  -  xi ,  we  see  that3  H(E)  =  -x0  logx0  -  (1  -  x$)  log(l  -  x0 ).  The 
channel  matrix  is  an  identity  matrix,  so  the  conditional  probability  distribution 
P(15|A)  is  made  up  of  zeroes  and  ones,  therefore  H(E\A)  is  identically  zero. 
Hence,  the  capacity  is  the  maximum  over  xq  of  H{E),  which  is  easily  seen  to 
be  unity4  (and  occurs  when  x0  =  1/2).  Of  course,  we  could  have  obtained  this 
capacity5  without  appealing  to  mutual  information  since  we  can  noiselessly  send 
one  bit  per  tick,  but  we  wish  to  study  the  non-trivial  cases  and  use  this  as  a 
starting  point. 

Case  2  —  N  clueless  senders  and  one  receiver  (M  =  1) 

This  case  reduces  to  the  indistinguishable  receivers  case  with  N  senders  analyzed 
in  [5]  with  both  an  exit  Mix-firewall  that  we  have  been  discussing  and  an  entry 

3  All  logarithms  are  base  2. 

4  The  units  of  capacity  are  bits  per  tide  i,  but  we  will  take  the  units  as  being  under¬ 
stood  for  the  rest  of  the  report.  Recall  that  all  symbols  take  one  t  to  pass  through 
the  channel. 

5  This  uses  Shannon’s  [9]  asymptotic  definition  of  capacity,  which  is  equivalent  for 
noiseless  channels  (in  units  of  bits  per  symbol). 
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Mix-firewall  (with  the  receivers  behind  the  latter).  Alice  can  either  send  or  not 
send  a  message,  so  the  input  alphabet  again  has  two  symbols.  Eve  observes 
N  +  2  possible  output  symbols.  That  is,  Eve  sees  e\  —  (N  4- 1,0),  e2  =  (AT,  1), 
C3  =  (N  —  1,2),  •  •  • ,  ew+2  —  (0,  N  +  1).  A  detailed  discussion  of  this  case  can 
be  found  in  [5]. 


3.2  Some  Special  Cases  for  Two  Receivers  (M  =  2) 

There  are  two  possible  receivers.  Alice  can  signal  Eve  with  an  alphabet  of  three 
symbols:  1  or  2,  if  Alice  transmits  to  Ri  or  i?2>  respectively,  or  the  symbol  0  for 
not  sending  a  message.  Let  us  analyze  the  channel  matrices  and  the  entropies 
for  different  cases  of  senders. 

The  symbol  ej  that  Eve  receives  is  an  3-tuple  of  the  form  (aj,a{,<4),  where 
a\  is  the  number  of  messages  received  by  ith  receiver.6  As  before,  the  index  i  =  0 
relates  to  Alice  not  sending  any  message.  The  elements  of  the  3-tuple  must  sum 
to  the  total  number  of  senders,  N  +  1, 

2 

Eai=iV+i- 

i= 0 

Case  3  —  No  clueless  senders  and  two  receivers  ( N  =  0,  M  —  2) 

Alice  is  the  only  sender  and  can  send  messages  to  two  possible  receivers.  The 
channel  matrix  is  trivial  and  there  is  no  anonymity  in  the  channel. 

(1,0,0)  (0,1,0)  (0,0,1) 

0/1  0  0  \ 

Mo. 2  =110  1  0  ) 

2  \  0  0  1  J 

The  subscript  0.2  represents  one  sender  (Alice  alone)  and  two  receivers.  The  3x3 
channel  matrix  M0.2  [hj]  represents  the  conditional  probability  of  Eve  receiving 
the  symbol  ey,  when  Alice  sends  to  the  receiver  Ri  (A  =  i).  ‘O’  stands  for  not 
sending  a  message. 

The  mutual  information  I  is  given  by  the  entropy  H  ( E )  describing  Eve 


I(E,A)  =  H(E)  -  ~xi  logxi  -  x2\ogx2  -  (1  -  xx  -  x2)log(l  -  xx  -  x2). 


The  capacity  of  this  noiseless  covert  channel  is  log  3  «  1.58  (at  Xi= 1/3,  i  — 
0,1,2).  For  M  —  2  this  is  the  largest  capacity,  which  we  note  corresponds  to 
zero  anonymity. 

Case  4  —  N  =  1  clueless  sender  and  M  —  2  receivers 

6  Recall  that  the  aj’s  of  the  output  symbol  are  not  directly  related  to  A,  which  denotes 
the  distribution  of  Alice. 
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Eve 


Fig.  2.  Case  4:  system  with  N  =  1  clueless  sender  and  M  =  2  receivers 


The  following  row  vector  describes  the  probabilities  of  the  possible  output  sym¬ 
bols  when  only  one  clueless  sender  is  involved. 

<1,0,0)  (0,1,0)  (0,0,1) 

(  P  9/2  q/2  ) 

The  message-set  matrix  given  below  shows  how  the  various  output  symbols  can 
be  formed.  The  rows  correspond  to  Alice?s  actions,  and  the  columns,  correspond 
to  the  actions  of  Clueless.  Row  and  column  labels  are  added  elementwise  to  form 
the  matrix  entry,  which  is  the  output  symbol  corresponding  to  the  channel  state. 

(1,0,0)  (0,1,0)  (0,0,1) 

(1,0,0)  /  (2, 0,0)  (1,1,0)  (1,0, 1>\ 

(0,1,0)  (1,1,0)  (0,2,0)  (0,1,1) 

(0,0,1)  V  (1,0,1)  (0,1,1)  (0,0,2)/ 

The  set  of  distinct  symbols  formed  in  the  matrix  cells  constitutes  the  set  of 
output  symbols  Eve  may  receive.  In  this  case,  there  are  three  repetitions  in  the 
message-set  matrix,  so  Eve  may  receive  9-3  =  6  symbols. 

Let  us  consider  the  channel  matrix. 

(2,0,0)  (1,1,0)  (1,0,1)  (0,2,0)  (0,1,1)  (0,0,2) 

0  /  p  q/2  q/2  0  0  0  \ 

Mi.2  =10  p  0  q/2  q/2  0  1 

2  \  0  0  p  0  q/2  q/2  ) 

The  3x6  channel  matrix  j]  represents  the  conditional  probability  of  Eve 

receiving  the  symbol  ej  when  Alice  sends  to  Ri-  As  noted,  the  dummy  receiver 
Ri,  corresponds  to  Alice  not  sending  to  any  receiver  (however  this  is  still  a 
transmission  to  Eve  via  the  quasi-anonymous  channel). 
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Given  the  above  channel  matrix  we  have: 


H(E)  =  -{px0  log[px0] 

+[qx  o/2  +  px  i]  log[qx0/2  +  pxi\ 

+[qxo!2  +  px2]  \og[qx0/2  +  px  2] 

+[gxi/2]  log[gxi/2]  +  [gxi/2  +  qx2/2]  log^xj/2  +  qx2j  2) 
+[9^2/2]  log[grr2/2]}. 

The  conditional  entropy  is  given  by 


h(e\a)  =  -'£ 

i= 0 


6 

p(xi)  y^/»(ej|xi)logp(eJ  |ii) 

3~  1 


=  Mp)  , 


where  h 2  (p)  denotes  the  function 


Mp)  =  “  (1  ~p)/21og((  1  ~p)/2)  -  (1  —  p)/21og((l  -p)/2)  p log p 

=  -(1  -p)log((l  —  jp)/2)  -plogp  . 

The  mutual  information  between  Alice  and  Eve  is  given  by 


Fig.  3.  Capacity  for  JV  =  1  clueless  sender  and  M  =  2  receivers 
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and  the  channel  capacity  is  given  by 

C  =  max  I  {A,  E) 

A 

—  max  -{px0\og\pxo] 

+[gar0/2+pxi]  log[9X0/2+pxi] 

+[qx0/2+px2}  log[gx0/2+px2] 

+[<Fi/2]  \og[qx1/2]+[qxi/2+qx2/2]  log[gxi/2+gx2/2] 

+[gx2/2]  log[gx2/2]}-h2(p). 

Note  that  the  maximization  is  over  x\  and  x%1  since  is  determined  by 
these  two  probabilities  (holds  for  any  N).  This  equation  is  very  difficult  to  solve 
analytically  and  requires  numerical  techniques.  Figure  3  shows  the  capacity  for 
this  case  with  the  curve  N  =  1.  From  the  plot  the  minimum  capacity  is  approx¬ 
imately  0.92,  when  p  =  1/3.  This  is  less  than  1.58,  which  is  the  corresponding 
value  for  N  —  0  case. 

Case  5  —  N  —  2  clueless  senders  and  M  —  2  receivers 


Eve 


Fig.  4.  Case  5:  system  with  N  =  2  clueless  senders  and  M  —  2  receivers 


The  row  vector  describing  the  output  symbols  and  their  probabilities  with 
only  the  two  clueless  senders  only  is  given  by 

(2,0,0)  (1,1,0)  (1,0,1)  (0,2,0)  (0,1,1)  (0,0,2) 

(  p2  pq  pq  g2/4  g2/2  g2/4  ) 

The  symbol  (2, 0, 0)  has  probability  p2  because  both  clueless  do  not  send  a  mes- 
sage.  The  symbol  (1,1,0)  has  probability  2p(q/2)  because  either  Cluelessi  does 
not  send  a  message  and  Clueless2  sends  a  message  to  Ri  or  visa  versa.  The  other 
values  behave  similarly.  The  message  set  matrix,  which  has  the  contributions 
from  the  clueless  as  the  column  index  and  the  contributions  from  Alice  as  the 
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row  index,  is  as  follows. 

(2,0,0) 

(1,1,0) 

(1,0,1) 

(0,2,0) 

(0,1,1) 

(0,0,2) 

(1,0,0) 

( (3,0,0) 

(2,1,0) 

(2,0,1) 

(1,2,0) 

(1,1,1) 

(1,0,2) 

(0,1,0) 

(2,1,0) 

(1,2,0) 

(1,1,1) 

(0,3,0) 

(0,2,1) 

(0,1,2) 

(0,0,1) 1 

((2,0,1) 

(1,1,1) 

(1,0,2) 

(0,2,1) 

(0,1,2) 

(0,0,3) 

By  inspection  of  the  matrix,  we  notice  that  the  output  symbols  with  more  repeti¬ 
tions  will  have  higher  probability  of  being  seen  by  Eve,  when  compared  to  others. 

That  is,  output  symbol  (1,1,1)  will  have  a  greater  probability  of  being  observed 
than  (3,0,0)  or  (0,3,0) .The  probability  of  observing  a  symbol  also  depends  on 
the  probability  distribution  of  the  transmitter  over  the  receivers  (i.e.,  the  value 
of  q).  There  are  eight  repetitions  in  the  message-set  matrix,  so  the  number  of 
total  possible  symbols  Eve  may  receive  18  -  8  =  10  symbols.  The  channel  matrix 
M2. 2  is  given  below. 

(3,0,0)  (2,1,0)  (2,0,1)  (1,2,0)  (1,1,1)  (1,0,2)  (0,1,2)  (0,3,0)  (0,2,1)  (0,0,3) 

0  /  P2  PQ  pg  q2/  4  q2/ 2  q2 / 4  0  0  0  0  \ 

M2.2  =  1(0  p2  0  pq  pq  0  q2  /  4  q2  /  4  q2  f  2  0  I 

2  \  0  0  p2  0  pq  pq  q2  f2  0  q2  /  4  q2/4  ) 

The  3  x  10  channel  matrix  M2.2^,  j]  represents  the  conditional  probability  of 
Eve  receiving  ej  when  Alice  sends  a  message  to  receiver  Ri . 


Fig.  5.  Capacity  for  N  =  2  clueless  senders  and  M  =  2  receivers 
Figure  5  shows  the  capacity  for  this  case  N  =  2.  Again,  the  minimum  capacity 
is  found  at  p  =  1/3  =  1  /(M  +  1).  From  the  plot  the  minimum  capacity  is 
approximately  0.62,  when  p  =  1/3. 
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3.3  Some  Special  Cases  for  Three  Receivers  (M  ~  3) 

Case  6  —  system  with  N  =  1  clueless  senders  and  M  =  3  receivers 
Alice  or  Clueless  can  send  to  three  possible  receivers  or  refrain  from  sending 
(denoted  by  ‘O’).  The  probabilities  of  the  various  output  symbols  from  the  one 
clueless  sender  are  given  below. 

<1,0, 0,0)  <0,1, 0,0)  <0,0, 1,0)  <0,0, 0,1) 

(  P  9/3  9/3  9/ 3  ) 


Eve 


Fig.  6.  Case  6:  system  with  N  —  1  clueless  senders  and  M  —  3  receivers 


Now  let  us  examine  the  number  of  possible  message  set  symbols  obtained  if 
we  merge  the  individual  message  sets  of  Alice  and  Clueless. 

<1,0, 0,0)  <0,1, 0,0)  (0,0, 1,0)  <0,0, 0,1) 

<1,0, 0,0)  <2, 0,0,0)  <1,1, 0,0)  (1,0, 1,0)  <1,0, 0,1)  \ 

<0,1, 0,0)  <1,1, 0,0)  <0,2, 0,0)  <0,1, 1,0)  <0,1, 0,1) 

<0,0, 1,0)  <1,0, 1,0)  <0,1, 1,0)  <0,0, 2,0)  <0,0, 1,1) 

(0,0,0, 1)  \  <1,0,0,  <0,1, 0,1)  <0,0, 1,1)  <0,0, 0,2 )  J 

As  we  can  see  from  the  above  message-matrix,  there  are  six  repetitions  in  the 
message  sets  formed,  so  Eve  may  receive  =  16  -  6  =  10  different  symbols. 

The  channel  matrix  Mi^is  given  below. 

<2,0, 0,0}  <1, 1,0,0}  (1,0, 1,0)  <1,0,0,!)  <0,2,0, 0)  <0,1, 1,0)  <0,1, 0,1)  <0,0, 2,0)  <0,0,1, 1)  <0,0, 0,2) 

0  /  p  g/3  g/3  5/3  0  0  0  0  0 

1(0  p  0  0  5/3  5/3  5/3  0  0 

2  10  0  p  0  0  5/3  0  5/3  5/3 

3  \  0  0  0  p  0  0  5/3  0  5/3  g/3 

The  4  x  10  channel  matrix  represents  the  conditional  probability  of 

Eve  receiving  ej  when  Alice  sends  a  message  to  receiver  Ri. 
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000 


Fig.  7.  Capacity  for  N  =  1  clueless  sender  and  M  —  3  receivers 

Figure  7  shows  the  capacity  for  this  case  of  N  =  1.  The  minimum  capacity 
is  found  at  p  =  1/4  =  1/(M  +  1).  From  the  plot  the  minimum  capacity  is 
approximately  1.25,  when  p  =  1/4. 

Case  7:  system  with  N  =  2  clueless  senders  and  M  =  3  receivers 

The  row  vector  describing  how  the  clueless  users  influence  the  output  symbols 
is  given  below. 


(2,0,0, 0)  (1,1, 0,0)  (1,0, 1,0)  <1,0, 0,1)  (0,2, 0,0)  (0,1, 1,0)  (0, 1,0,1)  <0,0, 2,0)  (0,0,1, 1)  (0,0,0, 2) 

(  p2  2pqfZ  2pq/Z  2pq/Z  q2  /  9  2g2/9  2 92/9  q2  j  9  292/9  g2/9 


Now  let  us  examine  the  size  of  the  set  of  output  symbols  obtained  if  we  merge 
the  individual  message  sets  of  Alice  and  the  two  clueless  senders: 


(2, 0,0,0) 
(1,0, 0,0)  /(3, 0,0,0) 
(0, 1,0,0)  [  (2, 1,0,0) 
(0,0, 1,0)  I  (2, 0,1,0) 
(0,0, 0,1)  \(2, 0,0,1) 


(1, 1,0,0)  (1,0, 1,0) 

(2, 1,0,0)  (2,0, 1,0) 

(1,2, 0,0)  (1,1, 1,0) 

(1,1, 1,0)  (1,0, 2,0) 

(1, 1,0,1)  (1,0, 1,1) 


(1,0,0, 1)  (0,2,  0,0) 
(2, 0,0,1)  (1,2,  0,0) 
(1,1, 0,1)  (0,3, 0,0) 

(1,0, 1,1)  (0,2, 1,0) 
(1,0, 0,2)  (0,2, 0,1) 


(0,1, 1,0)  (0, 1,0,1) 

(1, 1,1,0)  (1, 1,0,1) 

(0,2, 1,0)  (0,2, 0,1) 

(0,1, 2,0)  (0,1, 1,1) 

(0,1, 1,1)  <0,1,0, 2) 


(0,0, 2,0)  (0,0, 1,1) 
(1,0, 2,0)  (1,0, 1,1) 
(0,1, 2,0)  (0,1, 1,1) 
(0,0, 3,0)  (0,0, 2,1) 

(0,0, 2,1)  (0,0, 1,2) 


As  we  can  see,  there  are  20  repetitions  in  the  symbols  formed.  Hence,  the  total 
symbols  seen  by  Eve  become  =  40  -  20  =  20  symbols. 

If  we  look  through  the  columns  (1, 1,0,0),  (0,1, 1,0)  and  (1,0, 1,0),  we  can 
find  the  element  (1, 1, 1, 0)  common  to  all  the  three  columns.  There  are  two  more 
similar  cases  for  a  common  element  in  three  columns.  From  this,  we  conclude  that 
the  message  sets  with  even  distribution  of  messages  seem  to  have  a  single  element 
common  to  many  of  the  them,  whereas  those  with  skewed  distribution  seem  to 
be  unique.  This  is  expected,  as  the  ways  to  distribute  over  several  receivers  is 
multiple,  while  there  is  only  one  way  for  all  senders  to  send  to  the  same  receiver. 


) 


(0,0, 0,2) 
<1,0,0,2)\ 
(0,1, 0,2)  ] 
(0,0, 1,2) 
(0,0, 0,3)/ 
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The  channel  matrix  (split  into  two)  is  given  below. 


(3, 0,0,0} 

(2, 1, 0,0) 

<2, 0,1,0) 

(2,0,0, 1) 

<1/2, 0,0) 

P2 

2pg/3 

2pq/ 3 

2pg/3 

92/9 

0 

Pd 

0 

0 

2j?g/3 

0 

0 

P2 

0 

0 

0 

0 

0 

P2 

0 

(0,3,0, 0} 

<0,2, 1,0} 

<0,2, 0,1) 

<0,1, 2,0) 

<0,1,0, 2) 

0 

0 

0 

0 

0 

g2/9 

H2/9 

2g2/» 

?V9 

g2/o 

0 

?2/9 

0 

2fl*/9 

0 

0 

0 

«2/9 

0 

2«2/9 

<1,0,  2,0) 

<1,0,0, 2) 

<1,1, 1,0) 

<1,1, 0,1) 

<1,0, 1,1) 

«2/9 

92/9 

2  g2/9 

2q2/9 

2g2/9 

0 

0 

2pq/3 

2pqf3 

0 

2pg/3 

0 

2p?/3 

0 

2pg/3 

0 

2pq/3 

0 

2pq/3 

2pq/3 

<0, 1,1,1) 

<0,0,  3,0) 

<0, 0,2,1) 

<0,0, 1,2} 

(0,0, 0,3} 

0 

0 

0 

0 

C 

2q2/9 

0 

0 

0 

0 

2ql/9 

«2/9 

2?2/9 
92/ 9 

g2y  9 

2g  /9 

0 

2«2/9 

0 

g2/9 

The  4  x  20  channel  matrix  M2.3  [i,j]  represents  the  conditional  probability 
of  Eve  receiving  ej  when  Alice  sends  a  message  to  receiver  jR*.  The  generalized 


Pig.  8.  Capacity  for  N  =  2  clueless  senders  and  M  =  3  receivers 

formula  for  the  matrix  elements  is  given  by 


m(3,j)  = 


2 


“o!“i!a2!(a3-l)! 
0 


■Pakq/ 3) 


2 -o' 


for  a{  =  1,2,3 
for  03  =  0 


Figure  8  shows  the  capacity  for  this  case  in  the  curve  when  N  =  2.  The  minimum 
capacity  is  found  at  p  =  1/4  =  1/(M  -f  1).  From  the  plot  the  minimum  capacity 
is  approximately  0.89,  when  p  =  1/4,  which  is  less  than  the  lowest  capacity  for 
the  N  =  1  case. 


Eve 


Ri 


R2 


Rz 


Fig.  9.  Case  7:  system  with  N  =  2  clueless  senders  and  M  =  3  receivers 


3.4  Some  Generalized  Cases 

Case  8:  iV  —  1  Clueless  and  M  receivers 


Fig.  10.  Case  8:  system  with  N  =  1  clueless  sender  and  M  receivers 
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We  generalize  the  scenario  to  one  clueless  transmitter  and  M  receivers.  The 
probability  describing  the  actions  of  only  the  one  clueless  sender  is  given  below. 

<1,0, ,o)  <0, 1,0,0,  —  ,o)  <o,o, i,o, ...  ,o)  <o,o,o,i,...  ,o)  ...  <o, 0,0,0,...  ,1) 

(  P  Q/M  g/M  q/M  ...  q/M  ) 


The  message  set  matrix  is  given  below. 


<1,0,  o,o,...  ,o) 
<o,i,o,o,...  ,o) 
<0,0, 1,0,...  ,o) 
<0,0,0,!,...  ,0} 


<0,0, 0,0,...  ,1) 


<1,0, 0,0,. 

••  ,0} 

<0, 1,0,0... 

*  ,0} 

(0,0, 1,0,. . 

-»0> 

<0, 0,0,1, 

~  .0)  * 

.  <0,0, 0,0,. 

-.1) 

/  <2, 0,0,0,. 

-  -  1 0} 

<1,1, 0,0,.. 

.,0) 

<1,0, 1,0,.. 

.,0} 

<1,0,0, 1, 

-4  * 

.  <1,0, 0,0,. 

-  ,1>\ 

<1, 1,0,0,. 

•  ■  ,0) 

<0,2, 0,0,.. 

.,0} 

(0,1, 1,0,.. 

*  >0) 

<0, 1,0,1,. 

-  <0, 1,0,0,. 

..  ,1) 

<1, 0,1,0,. 

..  ,0) 

<0,1, 1,0,.. 

.  ,0) 

(0,0, 2,0,.. 

-  ,0} 

<0,0, 1,1,. 

.  <0,0, 1,0,. 

..  ,1) 

<1,0,0, 1,. 

..,0) 

<0,1,0,!,.. 

*  *0} 

(0,0, 1,1,.. 

.,0) 

<0,0, 0,2,. 

•  <0,0,0,!,. 

..  ,1) 

<1,0, 0,0,. 

...1) 

<0,1, 0,0,.. 

-.1) 

<0,0, 1,0,.. 

<0,0,0,!,. 

.  <0,0, 0,0,. 

■■  ,2)/ 

The  number  of  output  symbols  that  may  be  seen  by  Eve  is  identical  to  the 
total  possible  distinct  pairs  in  the  message-set  matrix  shown  above.  There  are 
two  indistinguishable  transmissions  (including  null  transmissions)  and  they  are 
sent  into  M-f  1  distinct  receivers  (urns)  (this  also  includes  the  null  transmission, 
which  by  convention  goes  to  R0}  not  shown  in  the  figure).  Combinatorics  tells  us 
then  that  there  are  (M%2)  distinct  combinations  (symbols)  that  Eve  may  receive. 


The  channel  matrix  is  given  below. 

<2,  0,0,0,  ...  ,©>  <1, 1,0,0,  ...  ,0)  <1,0,1,  0,  ...  ,0} 

o  /  p  g/M 

1  ' 


q/M 

0 


<1,0,  0,0,.. 
q/M 
0 

-  0 
0 


,  1}  <0,2,0,  0,  ...  ,0) 
0 

q/M 

0 

0 


<0,0,0,  o,. . 
0 
0 
0 
0 


,2} 


M  V  0  0  0  ...  p  O  ...  q/M  / 

The  (M  + 1)  x  (M^2)  channel  matrix  j]  represents  the  conditional  prob¬ 

ability  of  Eve  receiving  ej  when  Alice  sends  a  message  to  receiver  Rz. 

The  probability  distribution  among  the  elements  of  the  channel  matrix  can 
be  calculated  by  the  formula  below. 


p4  ( q/M)N~4  :  of  ^0  V*  =  1,2,3,-*-  ,M  and]  =  1,2,3,-*- ,  (M+2) 
0  :  of  =  0 


p«-i)  (g/M)w-°o+i  :  aj  #  0  Vj  =  0, 1, 2,  •  •  -  ,  (M+2) 

0  :  4  =  0 

The  conclusions  and  more  generalizations  related  to  this  case  are  discussed 
in  the  results  section. 

Case  9:  N  clueless  senders  and  M  =  2  receivers 

In  this  case,  we  generalize  the  problem  to  N  clueless  transmitters  for  the  two 
receivers  case.  The  total  number  of  message  set  symbols  seen  by  Eve,  if  only 
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the  clueless  are  transmitting,  can  be  calculated  as  the  number  of  combinations 
in  which  N  transmitters  can  send  (or  not  send)  a  message  times  the  number  of 
combinations  in  which  the  messages  sent  can  be  distributed  into  two  receivers. 
If  k  out  of  N  transmitters  send  a  message,  then  the  k  messages  sent  can  be  di¬ 
vided  into  two  receivers  in  A;+l  possible  combinations  ((fc,  0),  (k- 1, 1), . . .  ,  (0,  k)). 

message  set  size  =  1  +  2  +  3  +  44- - h  (N  -+  2) 

TV+ 2 

=  £* 

i=0 

=  (N  +  2)(N  +  3)/2 


Eve 


Fig.  11.  Case  9:  system  with  N  clueless  senders  and  M  —  2  receivers 


The  probability  of  each  channel  state  with  clueless  only  is  as  follows. 

(TV,  0, 0)  (TV  —  1,1,0)  (TV-1,0,1)  (AT- 2,2,0)  (TV-2,1,1)  (TV-  2,0,2) 

(  pN  NpN~1p/2  NpN~1qj2  TV(TV  -  l)pN~2q2/S  N(N  —  l)pN~2q2 /4  N{N  -  l)pN~2q 

Now  let  us  merge  the  individual  message  sets  of  Alice  and  the  N  clueless 
transmitters  to  determine  the  number  of  symbols  received  by  Eve. 

(AT, 0,0)  (N  —  1,1,0)  (TV  — 1,0, 1)  (AT  — 2,2, 0)  (AT-2,1,1)  (TV-2,0,2)  ... 

(1,0,0)  /(TV  +  1,0,0)  (TV, 1,0)  (TV, 0,1)  (TV-1,2,0)  (TV-1,1,1)  (TV-1,0,2)  ... 

(0,1,0)  (TV, 1,0)  (TV-1,2, 0>  (TV-1,1,1)  (TV-2,3,0)  (TV-2,2,1)  (TV-2,1,2)  ... 

(0,0,1)  \  (TV, 0,1)  (TV-1,1,1)  (TV-1,0,2)  (TV-2,2,1)  (TV-2,1,2)  (TV-2,0,3)  ... 

As  observed  before,  the  message  set  (7V/3+1,  N/ 3,  N/ 3)  is  the  most  uniform  mes¬ 
sage  distribution.  Hence,  it  has  maximum  number  of  repetitions  in  the  message 
set  matrix  and  will  have  a  greater  probability  of  being  observed  than  (N  + 1, 0, 0) 
or  (0,1,  iV)  . 


...  (0,0,iV> 

78  (9/2)N  ) 


(0,0,  JV) 
<1,0,  N)  \ 
<0,1,  N) 
(0,0,  N+l)  ) 
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The  channel  matrix  M^}2  is  given  below, 

(N  +  1*0,0)  (N,l,  0>  0*  1>  (M-  1,2,0}  (N  —  1,1,1}  <]¥- 1*0,2}  ...  {0, 0*  N  +  1> 

o  /  P2^  /*f^-5*/2  N(iV  -  l)pN-2q2/8  ^(JV  -  I)pW“2*a/4  JVT(J\r  -  I)pN_2g2/S  ...  0 

t  I  0  pN  0  NpN~lq/2  NpN~1q/2  0  .  ..  0 

2  V  0  0  P2*  0  Np^-ig/S  Np^-^/S  ...  (q/2)^ 


The  3  x  ((IV  4*  2)(IV  +  3)/2)  channel  matrix  Mjv.2[*» i]  represents  the  condi¬ 
tional  probability  of  Eve  receiving  ej  when  Alice  sends  a  message  to  receiver  U*. 
The  probability  distribution  in  the  channel  matrix  can  be  imagined  as  nesting 
of  two  binomial  distributions:  First,  between  messages  sent  and  received;  sec¬ 
ond,  the  distribution  of  messages  sent  to  the  two  receivers.  So,  given  the  vector 
{aj,  af,  af),  the  element  of  the  channel  matrix  can  be  generalized  by  the  formula 
below. 


mo>i  =  ^ai  _  j^^°°  P r°b*  distribution  of  ( N  —  (aj  —  1))  messages  to  Ri  and  R2 

= (4-  '  4 ' 1))(’/2)*{-(«/2)4 

mjj=®y“irr5)<9/2)W" 


Note  that  a|  does  not  explicitly  appear  but  is  implicitly  in  the  above  since 
(oj  +  +  <4)  “  1  —  N}  this  relationship  will  be  seen  to  be  important  in  the 

following  general  case  (where  we  use  a  generalized  combinatorial  formula).  The 
conclusions  and  more  generalizations  related  to  this  case  are  discussed  in  the 
results  section. 


Case  10  —  N  clueless  senders  and  M  receivers 

We  now  generalize  the  problem  to  N  clueless  senders  and  M  receivers  (refer 
again  to  Figure  1).  There  are  IV  * f  1  indistinguishable  transmissions  (including 
null  transmissions)  and  they  are  sent  into  M  +  1  distinct  receivers  (urns)  (this 
also  includes  the  null  transmission,  which  by  convention  goes  to  i?0,  not  shown 
in  the  figure).  Combinatorics  tells  us  then  that  there  are  K  =  (N%^1)  possible 
symbols 

The  rows  of  our  channel  matrix  correspond  to  the  actions  of  Alice,  The 
iih  row  of  M jv.m  describes  the  conditional  probabilities  p(ej\xi)  (For  simplicity 
we  will  not  always  explicitly  note  that  j  =  1, . . .  ,  (N%% j+1)0*  By  convention  et 
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always  corresponds  to  every  sender  not  sending  a  message  (which  is  equivalent  to 
all  senders  sending  to  Rq)  .  Therefore  ei  is  the  M  + 1  tuple  (N + 1 , 0, . . .  ,  0) .  Given 
our  simplifying  semi-uniformity  assumption  for  the  clueless  senders5  distribution, 
this  term  must  be  handled  differently. 


The  first  row  of  the  channel  matrix  is  made  up  of  the  terms  Mjv.m  [Oj].  Here, 
Alice  is  not  sending  any  message  (i.e.,  she  is  “sending”  to  jRo),  so  Alice  con¬ 
tributes  one  to  the  term  aj  in  the  M  -f  1  tuple  {aJ0,a{ya{y . . .  yajM)  associated 
with  ej.  In  fact,  this  tuple  is  the  “long  hand55  representation  of  ej.  Therefore 
the  contributions  to  the  M  +  1  tuple  (aj  -  l,a£,a£, . . .  ,ajM)  describe  what  the 
N  clueless  senders  are  doing.  That  is,  a30  -  1  clueless  senders  are  not  sending 
a  message,  a{  clueless  senders  are  sending  to  Rx ,  etc.  Hence,  the  multinomial 
coefficient  (ai_l0^_  a5  )  tells  us  how  many  ways  this  may  occur.7  For  each 
such  occurrence  we  see  that  the  transmissions  to  R0  affect  the  probability  by 
pa and  the  transmissions  to  Riyi  >  0,  due  to  the  semi-uniformity  assump¬ 


tion,  contribute  ( q/M)ai .  Since  the  actions  are  independent,  the  probabilities 
multiply,  and  since  a j  -  1  4-  a{  4 - {-  a?M  =  N,  we  have  a  probability  term 

of  pao~1(q/M)N+1~ao .  Multiplying  that  term  by  the  total  number  of  ways  of 
arriving  at  that  arrangement  we  have  that: 

The  other  rows  of  the  channel  matrix  are  i  >  0.  For  row  i  >  0, 

we  have  a  combinatorial  term  (  ,  ,  *  Ji  .  ,•  ,  )  for  the  N  clueless 


senders,  a30  of  which  are  sending  to  Ro  and  N  -  a{  of  which  are  sending  to  the 
Ri,i  >  0.  Therefore,  we  see  that  under  the  uniformity  assumption, 

UN.M[i,j)  =  (ojl„5>...,oj_1>of_i,oi+l)...,a’M)poS(9/M);V-a50  ,i  >  0  . 

We  show  the  plots  of  the  mutual  information  when  the  clueless  senders  act 
(as  assumed  throughout  the  report)  in  a  semi-uniform  manner  and  when  Alice 
also  sends  in  a  semi-uniform  manner  (i.e.,  Xi  —  (1  —  x0)/M,  i  =  1,2 
We  conjecture  based  upon  our  intuition,  but  do  not  prove,  that  Alice  having 
a  semi-uniform  distribution  of  destinations  Rl9 RM  when  the  clueless  senders 
act  in  a  semi-uniform  manner  maximizes  mutual  information  (achieves  capacity). 
This  has  been  supported  by  all  of  our  numeric  computations  for  capacity.  With 
this  conjecture,  we  can  reduce  the  degrees  of  freedom  for  Alice  from  M  to  1  (her 
distribution  A  is  described  entirely  by  xo),  which  allows  greater  experimental 
and  analytical  exploration. 

The  channel  matrix  greatly  simplifies  when  both  the  clueless  senders  and 
Alice  act  in  a  totally  uniform  manner .  That  is,  when  x0  =  1/(M  +  1),  then 
Xi  =  (1  —  x0)/M  —  1/(M  +  1)  for  all  xiy  and  p  —  1/(M  +  1).  We  have 


aj  )P< 

>aAf/ 


l(q/MY 


7  The  multinomial  coefficient  is  taken  to  be  zero,  if  any  of  the  “bottom”  entries  are 
negative. 
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which  simplifies  to 


MN.M[0,  j}  =  (  ,  f  j  )(17^-r)JV  - 

\0>q  ljai3**'  j&mJ  ^  "h  ^ 

(Note  this  form  for  i  =  0  is  due  to  the  total  uniformity  of  the  C\ s.).  We  also  have 


MN,M[hj)  =  f  j  j  j  1  j  j  )pa° 

\a0Ja1?...  j%f/ 


>0 


which  simplifies  to 


=  (  j  j 
\a0y°ls 


H- 1> 


i¥ 

a?  -  15  a{ 


?  Wi+I  j  * 


*>)( 

XM/ 


M  +  l 


r^>o 
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waiiai 
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BTHiEW 
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mm 
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6 
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0.5010 

0.7434 
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0.9587 
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inisia 

7 

0.0765 

0.2048 

irenai 

[flUjjji] 
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MhMM 
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MfcEMri 
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titiWi.i 

mm 

mu 

HI 

9 

[IJKSflrd 

0.4288 
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0.6051 
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0.8463 

Table  1.  C7(1/(M  +  1))  =  lower  capacity  bounds  for  all  p?  N  =  0> .  * .  ?9S 

and  M  =  1, . . .  ,  10 

To  determine  the  distribution  E  describing  Eve  we  need  to  sum  over  the 
columns  of  the  channel  matrix  and  use  the  total  uniformity  of  A. 

P(E  =  ej)  =  '£  P(E  =  ejjA  =  i)P(A  =  *)  ,  i  =  0, . . .  ,  M  , 

i 

This  gives  us 


pm  -  *>  -  u . ,4.)  = 

Prom  this  we  can  compute  the  entropy  i?{E)  without  too  much  trouble: 

H(B>  -  e  U"+,i 0  (N,oe(M + 11  -  1oe  Uw *  is,))  • 


+  i 
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However,  the  conditional  entropy  is  more  complicated,  but  is  expressible.  There¬ 
fore,  we  wrote  Matlab  code  to  calculate  the  mutual  information,  which  is  conjec¬ 
tured  to  achieve  capacity,  when  both  the  clueless  senders  act  in  a  semi-uniform 
manner  and  Alice  acts  in  a  totally  uniform  manner.  Local  exploration  of  nearby 
points  all  yield  lower  mutual  information  values. 

Table  1  tabulates  the  results  of  numerical  calculations  of  capacities  for  dif¬ 
ferent  combinations  of  values  of  N  and  M  using  Matlab.  We  conjecture  that 
when  Alice  acts  in  a  totally  uniform  manner  (that  is  every  Alice  probability  is 
1/(M  +  1))  that  capacity  is  achieved  when  the  p  values  are  the  same,  and  this 
capacity  is  the  lower  bound  for  all  capacities.  The  table  gives  capacity  with  p 
fixed  at  1/ (M- hi),  which  we  determined  numerically  to  be  less  than  the  capacity 
for  other  values  of  p. 


4  Discussion  of  Results 


Fig.  12.  Capacity  for  N  =  1  to  4  clueless  senders  and  M  =  2  receivers 


Figure  12  shows  the  capacity  as  a  function  of  p  with  M  =  2  receivers,  for 
N  ~  1,2,3, 4  clueless  senders.  In  all  cases,  the  minimum  capacity  is  realized 
at  p  -  1/3,  and  the  capacity  at  p  =  1  is  log  3.  As  N  increases,  the  capacity 
decreases,  with  the  most  marked  effects  at  p—  1/3. 

In  Figure  12,  the  capacity  (of  course  under  the  semi-uniformity  assumption 
for  C{  which  is  in  force  throughout  the  report))  was  determined  numerically  for 
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Fig*  13.  Capacity  for  N  —  1,2,4  clueless  senders  and  M  =  3  receivers 


any  choice  of  A,  However,  for  the  remaining  plots,  we  applied  the  semi-uniformity 
conjecture  (that  Alice  is  better  off  behaving  semi-uniformly  if  that  is  what  the 
clueless  senders  do). 

Thus,  x0  is  the  only  free  variable  for  Aliceas  distribution  in  what  follows. 

Figure  13  shows  the  capacity  as  a  function  of  p  with  Af  =  3  receivers,  for 
N  =  1,2,4  clueless  senders.  As  expected,  in  all  cases,  the  minimum  capacity  is 
realized  at  p  =  1/4,  and  the  capacity  at  p  =  1  is  log  4  —  2.  As  N  increases, 
the  capacity  decreases,  with  the  most  marked  effects  at  p  —  1/4.  The  minimum 
capacity  is  greater  when  compared  to  corresponding  value  in  the  M  =  2  case 
(refer  to  plot  12). 

The  mutual  information  as  a  function  of  xQ  is  shown  in  Figure  14  for  M  =  2 
receivers  and  N  =  1  clueless  sender  for  p  =  0.25,0.33,0.5,0.67.  Here,  note  that 
the  curve  with  p  =  0.33  has  the  smallest  maximum  value  (capacity),  and  that 
the  value  of  xq  at  which  that  maximum  occurs  is  xq  =  0.33.  The  x0  value  that 
maximizes  the  mutual  information  (i.e.,  for  which  capacity  is  reached)  for  the 
other  curves  is  not  0.33,  but  the  mutual  information  at  xq  =  0.33  is  not  much 
less  than  the  capacity  for  any  of  the  curves. 

Figure  15  shows  the  mutual  information  curves  for  various  values  of  xq  as  a 
function  ofp,  with  N  =  2  clueless  senders  and  Af  =  2  receivers.  Similarly,  Figure 
16  shows  the  mutual  information  curves  for  various  values  of  x$  as  a  function  of 
p,  with  N  =  2  clueless  senders  and  Af  =  3  receivers. 

In  the  figure  15,  note  that  the  curve  for  —  l/(Af  +  l)  =  I/3has  the  largest 
minimum  mutual  information,  and  also  has  the  greatest  mutual  information  at 
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Fig.  14.  Mutual  information  vs.  x0  for  N  =  1  clueless  sender  and  M  =  2  receivers,  for 
p  =  0.25, 0.33, 0.5, 0.67 


Fig.  15.  Mutual  information  vs.  p  for  N  =  2  clueless  senders  and  M  —  2  receivers 


Pig*  16.  Mutual  information  vs.  p  for  N  =  2  clueless  senders  and  M  =  3  receivers 

the  point  where  p  =  1,  Le.,  when  there  is  no  noise  since  Cluelessi  is  not  sending 
any  messages.  The  capacity  for  various  values  of  p  is,  in  essence,  the  curve  that 
is  the  maximum  at  each  p  over  all  of  the  Xq  curves,  and  the  lower  bound  on 
capacity  occurs  at  p  —  1/3  =  1  f(M  +  1). 

Also  observe  that  the  x0  =  0.33  curve  has  the  highest  value  for  p  —  .33, 
but  for  other  values  of  p,  other  values  of  xq  have  higher  mutual  information 
(i.e.,  Alice  has  a  strategy  better  than  using  x0  =  0.33).  However,  the  mutual 
information  when  x0  —  0.33  is  never  much  less  than  the  capacity  at  any  value 
of  p,  so  in  the  absence  of  information  about  the  behavior  of  the  clueless  senders, 
a  good  strategy  for  Alice  is  to  just  use  x0  ~  1/(M  +  1).  These  observations 
are  illustrated  and  expanded  in  the  next  two  figures.  Note  the  differences  in 
concavity  between  Figure  14  and  Figure  15  ,  We  will  discuss  concavity  again 
later  in  the  report. 

Figure  17  shows  the  optimal  value  for  xq1  i.e.,  the  one  that  maximizes  mu¬ 
tual  information  and  hence,  achieves  channel  capacity,  for  N  =  1,2, 3, 4  clueless 
senders  and  M  =  3  receivers  as  a  function  of  p.  A  similar  graph  in  [5]  for 
M  =  1  receiver  is  symmetric  about  x0  =  0.5,  but  for  M  >  1  the  symmetry  is 
multidimensional,  and  the  graph  projected  to  the  (p,  £0)-plane  where  the  des¬ 
tinations  are  uniformly  distributed  is  not  symmetric.  However,  note  that  the 
optimum  choice  of  is  1/(M  +  1)  both  at  p  =  1  /(M  4-  1)  and  at  p  =  1,  that 
is,  when  the  clueless  senders  either  create  maximum  noise  or  when  they  do  not 
transmit  at  all  (no  noise).  As  N  increases,  the  optimum  Xq  for  other  values  of 
p  is  further  from  1  /(M  +  1).  Also  observe  that  Alice's  best  strategy  is  to  do 
the  opposite  of  what  the  clueless  senders  do,  up  to  a  point.  If  they  are  less 
likely  to  send  messages  (p  >  1/(M  +  1)),  then  Alice  should  be  more  likely  to 
send  messages  (x$  <  1/(M  +  1)),  whereas  if  Clueless*  is  more  likely  to  send 


Fig.  IT.  Value  of  xo  that  maximizes  mutual  information  for  N  —  1,2, 3, 4  clueless 
senders  and  M  —  3  receivers  as  a  function  of  p 

messages  (( p  <  1  /(M  +  1)),  then  Alice  should  be  less  likely  to  send  messages 

(z0>l/(M  +  l)). 

Figure  18  shows  the  degree  to  which  the  choice  of  x0  =  1/ ( M  + 1)  can  be  sub- 
optimal,  for  N  =  1,2, 3,4  clueless  senders  and  M  —  3  receivers.  The  plot  shows 
the  mutual  information  for  the  given  p  and  x0  =  1/(M  +  1),  normalized  by  di¬ 
viding  by  the  capacity  (maximum  mutual  information)  at  that  same  p.  Hence,  it 
shows  the  degree  to  which  a  choice  of  x0  =  1/(M  +  1)  fails  to  achieve  the  maxi¬ 
mum  mutual  information.  For  iV  =  2,  it  is  never  worse  than  0.94  (numerically), 
but  for  N  =  4,  its  minimum  is  0.88.  The  relationship  of  suboptimality  for  other 
choices  of  M  and  N ,  or  for  other  distributions,  is  not  known. 

In  Figure  19,  we  show  the  lower  bound  on  capacity  of  the  channel  as  a  function 
of  p  for  IV  —  1  clueless  sender  and  various  values  of  M  receivers.  Numerical 
results  show  that  this  lower  bound  increases  for  all  p  as  M  increases,  and  the 
lower  bound  on  the  capacity  for  a  given  M  occurs  at  p  —  1  /(M  +  1),  which  is 
indicated  by  the  dotted  lines  in  the  figure. 

For  Figure  20,  we  take  the  capacity  at  p  =  1/(M  -f  1),  which  we  found 
numerically  to  minimize  the  capacity  of  the  covert  channel,  and  plot  this  lower 
bound  for  capacity  for  many  values  of  N  and  M.  We  retain  the  assumption 
that  Xi  =  (1  —  xq)/(M  -f  1)  for  i  =  1,2,  that  is,  given  the  semi-uniform 

distribution  of  transmissions  to  the  receivers  by  the  clueless  senders,  it  is  best 
for  Alice  to  do  likewise.  Along  the  surface  where  N  =  0,  we  have  the  noiseless 
channel,  and  the  capacity  is  log(M  +  1),  which  is  also  the  upper  bound  for 
capacity  for  all  N  and  M.  The  values  along  the  surface  when  M  =  1  give  us  the 
same  values  we  derived  in  [5]. 
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Fig.  18.  Normalized  mutual  information  when  xq  =  1/4  for  N  —  1, 2, 3, 4 
senders  and  M  =  3  receivers 


clueless 
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Fig.  20.  Capacity  for  N  —  0  to  9  clueless  senders  and  M  —  1  to  10. 

5  Comments  and  Generalizations 

We  first  note  that  the  maximum  capacity  of  this  (covert)  quasi-anonymous  chan¬ 
nel  is  log(M  +  1)  for  M  distinguishable  receivers,  and  is  achievable  only  if  there 
are  no  other  senders  (N  =  0),  or  equivalently,  if  none  of  them  ever  send  (p  =  1), 

i.e.,  when  the  channel  is  noiseless. 

Here  are  some  of  the  observations  from  the  different  cases  considered,  un¬ 
der  the  semi-uniform  assumption  for  the  clueless  senders  and  the  semi-uniform 
conjecture  for  Alice,  followed  by  some  generalizations. 

1.  The  capacity  C(p,N,  M),  as  a  function  of  the  probability  p  that  a  clueless 
sender  remains  silent,  with  N  clueless  senders  and  M  receivers,  is  strictly 
bounded  below  by  C(j^ ,  N,  M),  and  is  achieved  with  x0  =  1  /(M  -f  1). 

2.  The  lower  bound  for  capacity  for  a  given  number  M  of  receivers  decreases 
as  the  number  N  of  clueless  senders  increases, 

C{jfe,N,M)  >  C(jfe,N  +  l,M). 

3.  The  lower  bound  for  capacity  for  a  given  number  N  of  clueless  senders  in¬ 
creases  as  the  number  M  of  distinguishable  receivers  increases, 
C(w+2’N’M  +  1)  >  C(jfc,N,M). 

These  observations  are  intuitive,  but  we  have  not  shown  them  to  be  true 
numerically  in  the  general  case  (we  did  for  the  case  that  M  =  1  in  [5]).  It  is 
interesting  to  note  that  increasing  the  number  of  distinguishable  receivers  in¬ 
creases  the  covert  channel  capacity,  which  in  some  sense  decreases  the  (sender) 
anonymity  in  the  system  (Alice  has  more  room  in  which  to  express  herself). 
This  is  a  bit  contrary  to  the  intuitive  view  of  anonymity  in  Mix  networks,  where 
more  receivers  tends  to  provide  “greater  anonymity.”  In  this  light,  we  note  that 
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Danezis  and  Serjantov  investigated  the  effects  of  multiple  receivers  in  statisti¬ 
cal  attacks  on  anonymity  networks  [3]*  They  found  that  Alice  having  multiple 
receivers  greatly  lowered  a  statistical  attacker’s  certainty  of  Alice’s  receiver  set. 

While  the  graphs  and  numerical  tests  support  that  the  “worst”  thing  the 
clueless  senders  can  do  is  to  send  (or  not)  with  uniform  probability  distribution 
over  the  H* ,  i  =  0, 1,  2, M ,  we  have  not  proven  this  mathematically.  Nor  have 
we  proven  that,  under  these  conditions,  the  best  Alice  can  do  is  to  send  (or  not)  to 
each  receiver  Ri  with  uniform  probability,  =  1/(M  +  1)  for  i  —  0, 1,2,...,M, 
although  the  numerical  computations  support  this.  The  proof  in  [5]  of  these 
conjectures  for  the  case  where  M  =  1  relied,  in  part,  on  the  symmetry  about 
x0  —  0.5,  which  is  not  the  case  when  M  >  1,  so  another  approach  must  be  used. 
However,  we  should  still  be  able  to  use  the  concavity/convexity  results  from  [5], 
Note  that  our  conjecture  that  the  best  that  Alice  cam  do  is  to  send  in  a  semi¬ 
uniform  manner,  and  the  results  illustrated  in  Figure  8,  seem  to  be  an  extension 
of  the  interesting  results  of  [4]. 

6  Conclusions  and  Future  Work 

This  report  has  taken  a  step  towards  tying  the  notion  of  capacity  of  a  quasi- 
anonymous  channel  associated  with  an  anonymity  netwwk  to  the  amount  of 
anonymity  that  the  network  provides.  It  explores  the  particular  situation  of  a 
simple  type  of  timed  Mix  (it  fires  every  tick)  that  also  acts  as  an  exit  firewall. 
Cases  for  varying  numbers  of  distinguishable  receivers  and  varying  numbers  of 
senders  were  considered,  resulting  in  the  observations  that  more  senders  (not 
surprisingly)  decreases  the  covert  channel  capacity,  while  more  receivers  increases 
it.  The  latter  observation  is  intuitive  to  communication  engineers,  but  may  not 
have  occurred  to  many  in  the  anonymity  community,  since  the  focus  there  is 
often  on  sender  anonymity. 

As  the  entropy  H  of  the  probability  distribution  associated  with  a  message 
output  from  a  Mix  gives  the  effective  size,  2H,  of  the  anonymity  set,  we  wonder 
if  the  capacity  of  the  residual  quasl-anonymous  channel  in  an  anonymity  system 
provides  some  measure  of  the  effective  size  of  the  anonymity  set  for  the  system  as 
a  whole.  That  is,  using  the  covert  channel  capacity  as  a  standard  yardstick,  can 
we  take  the  capacity  of  the  covert  channel  for  the  observed  transmission  charac¬ 
teristics  of  clueless  senders,  equate  it  with  the  capacity  for  a  (possibly  smaller) 
set  of  clueless  senders  with  maximum  entropy  (i.e.,  who  introduce  the  maximum 
amount  of  noise  into  the  channel  for  Alice),  and  use  the  size  of  this  latter  set 
as  the  effective  number  of  clueless  senders  in  the  system.  This  is  illustrated  in 
Figure  12,  with  the  vertical  dashed  line  showing  that  N  =  4  clueless  senders 
that  remain  silent  with  probability  p  =  0.87  are  in  some  sense  equivalent  to  one 
clueless  sender  that  sends  with  p  —  0.33. 

The  case  in  which  the  Mix  itself  injects  dummy  messages  into  the  stream  ran¬ 
domly  is  not  distinguishable  from  having  an  additional  clueless  sender.  However, 
if  the  Mix  predicates  its  injection  of  dummy  messages  upon  the  activity  of  the 
senders,  then  it  can  affect  the  channel  matrix  greatly,  to  the  point  of  eliminating 
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the  covert  channel  entirely.  We  are  also  interested  in  the  degree  to  which  the 
Mix  can  reduce  the  covert  channel  capacity  (increase  anonymity)  with  a  limited 
ability  to  inject  dummy  messages. 

In  future  work  we  will  analyze  the  situation  where  we  have  different  (and 
more  realistic)  distributions  for  the  clueless  senders.  We  are  also  interested  in 
different  kinds  of  exit  point  Mix-firewalls,  such  as  threshold  Mixes,  timed  Mixes 
(where  the  time  quantum  is  long  enough  to  allow  more  than  one  message  per 
sender  to  be  sent  before  the  Mix  fires),  timed-pool  Mixes,  and  systems  of  Mixes. 
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